tm: Cleanup include lookup #1991

b4n · 2018-11-10T11:34:09Z

Don't use the files inode as the hash. Although it looks like a good idea for de-duplicating links as well, it has several issues, including non-uniqueness of inodes across file systems.
The way it was done hashing the inode but comparing the file name string pointers also made the hash mostly irrelevant, as it just stored filenames sharing the same inode in the same hash bucket but without
actually doing any de-duplication, making the whole thing a convoluted way of converting to a list.

Instead, hash and compare the filenames themselves, which, even though it doesn't handle links de-duplication, is better than the non-functional previous code.

Also, directly build the list and only use the hash table as a way for checking for duplicates, which is both faster and gives a stable output.

See #1989

src/tagmanager/tm_workspace.c

bmwiedemann · 2018-11-10T20:33:18Z

The whole tm_file_inode_hash function can also be dropped, because it is unused

Don't use the files inode as the hash. Although it looks like a good idea for de-duplicating links as well, it has several issues, including non-uniqueness of inodes across file systems. The way it was done hashing the inode but comparing the file name string pointers also made the hash mostly irrelevant, as it just stored filenames sharing the same inode in the same hash bucket but without actually doing any de-duplication, making the whole thing a convoluted way of converting to a list. Instead, hash and compare the filenames themselves, which, even though it doesn't handle links de-duplication, is better than the non-functional previous code. Also, directly build the list and only use the hash table as a way for checking for duplicates, which is both faster and gives a stable output.

b4n · 2018-11-12T09:24:28Z

Oops, that's what you get for making changes and committing in a hurry.
Fixed, and this time I built it before, and ran our tests.

bmwiedemann · 2018-11-12T10:29:45Z

There is 6-9 lines of code duplication around the g_list_prepend calls - could be improved in a later PR.

b4n · 2018-11-12T10:43:44Z

@bmwiedemann what do you mean, between the glob and non-glob versions? I don't really mind given how the glob conditional is not trivial.

b4n · 2018-11-12T10:48:07Z

I just added an extra 2 things here:

process files in the order they appear on the command line. Before, the order was stable for a given command line, but was reversed, which IMO is clearly not the expected behavior. This said, basically nobody should care.
I added a test case that verifies the processing order.

elextr · 2018-11-12T11:02:09Z

LGBI

Is the runner.sh change really part of this, or is it a general change that possibly should be separate?

b4n · 2018-11-12T15:08:52Z

Is the runner.sh change really part of this, or is it a general change that possibly should be separate?

Well, it's not really specific to this, but it's needed for this test case because it requires parsing more than one input file at once, which the current automated setup doesn't allow. So yeah, it can be used by more test cases in theory, but in practice until now we didn't have a use case.

elextr · 2018-11-12T22:13:25Z

but it's needed for this test case because it requires parsing more than one input file at once

Thats fine then.

Process files in the order they appear on the command line when generating tags file, instead of a more or less random order. Closes #1989.

b4n mentioned this pull request Nov 10, 2018

Make geany -g tags output reproducible #1989

Closed

elextr reviewed Nov 10, 2018

View reviewed changes

src/tagmanager/tm_workspace.c Outdated Show resolved Hide resolved

elextr reviewed Nov 10, 2018

View reviewed changes

src/tagmanager/tm_workspace.c Outdated Show resolved Hide resolved

b4n force-pushed the tm-lookup-includes-cleaup branch from 08a3892 to fc6a9bb Compare November 12, 2018 09:23

Process files in the order they are listed when generating a tags file

939dab0

Add a test for the processing order when generating a tags file

8b68c5a

b4n force-pushed the tm-lookup-includes-cleaup branch from 998760c to 8b68c5a Compare November 12, 2018 10:47

b4n merged commit 8b68c5a into geany:master Nov 13, 2018

b4n added a commit that referenced this pull request Nov 13, 2018

Merge pull request #1991 from b4n/tm-lookup-includes-cleaup

fb39f67

Process files in the order they appear on the command line when generating tags file, instead of a more or less random order. Closes #1989.

b4n added this to the 1.34 milestone Nov 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tm: Cleanup include lookup #1991

tm: Cleanup include lookup #1991

b4n commented Nov 10, 2018

bmwiedemann commented Nov 10, 2018

b4n commented Nov 12, 2018

bmwiedemann commented Nov 12, 2018

b4n commented Nov 12, 2018

b4n commented Nov 12, 2018

elextr commented Nov 12, 2018

b4n commented Nov 12, 2018

elextr commented Nov 12, 2018

tm: Cleanup include lookup #1991

tm: Cleanup include lookup #1991

Conversation

b4n commented Nov 10, 2018

bmwiedemann commented Nov 10, 2018

b4n commented Nov 12, 2018

bmwiedemann commented Nov 12, 2018

b4n commented Nov 12, 2018

b4n commented Nov 12, 2018

elextr commented Nov 12, 2018

b4n commented Nov 12, 2018

elextr commented Nov 12, 2018